Search CORE

9 research outputs found

Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

We investigate whether Deep Reinforcement Learning (Deep RL) is able to synthesize sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be composed into complex behavioral strategies in dynamic environments. We used Deep RL to train a humanoid robot with 20 actuated joints to play a simplified one-versus-one (1v1) soccer game. We first trained individual skills in isolation and then composed those skills end-to-end in a self-play setting. The resulting policy exhibits robust and dynamic movement skills such as rapid fall recovery, walking, turning, kicking and more; and transitions between them in a smooth, stable, and efficient manner - well beyond what is intuitively expected from the robot. The agents also developed a basic strategic understanding of the game, and learned, for instance, to anticipate ball movements and to block opponent shots. The full range of behaviors emerged from a small set of simple rewards. Our agents were trained in simulation and transferred to real robots zero-shot. We found that a combination of sufficiently high-frequency control, targeted dynamics randomization, and perturbations during training in simulation enabled good-quality transfer, despite significant unmodeled effects and variations across robot instances. Although the robots are inherently fragile, minor hardware modifications together with basic regularization of the behavior during training led the robots to learn safe and effective movements while still performing in a dynamic and agile way. Indeed, even though the agents were optimized for scoring, in experiments they walked 156% faster, took 63% less time to get up, and kicked 24% faster than a scripted baseline, while efficiently combining the skills to achieve the longer term objectives. Examples of the emergent behaviors and full 1v1 matches are available on the supplementary website.Comment: Project website: https://sites.google.com/view/op3-socce

arXiv.org e-Print Archive

Machine Learning for the Zwicky Transient Facility

Author: Adams Scott
Bellm Eric C.
Biswas Rahul
Blagorodnova Nadejda
Branton Doug
Bue Brian
Burdge Kevin
Cannella Chris
Chang Chan-Kao
Connolly Andrew
Dekany Richard
Duev Dmitry A.
Feindt Ulrich
Fortson Lucy
Frederick Sara
Fremling C.
Gezari Suvi
Golkhou V. Zach
Graham Matthew
Groom Steven
Hung Tiara
Kasliwal Mansi M.
Kulkarni Shrinivas
Kupfer Thomas
Lin Hsing Wen
Lintott Chris
Lunnan Ragnhild
Mahabal Ashish
Masci Frank J.
Miller Adam A.
Nordin Jakob
Parejko John
Prince Thomas A.
Rebbapragada Umaa
Riddle Reed
Rusholme Ben
Saunders Nicholas
Sedaghat Nima
Shupe David L.
Singer Leo P.
Soumagnac Maayane T.
Szkody Paula
Tachibana Yutaro
Tirumala Kushal
van Roestel Jan
van Velzen Sjoert
Walters Richard
Ward Charlotte
Wright Darryl
Ye Quan-Zhi
Publication venue: 'Astronomical Society of the Pacific Conference Series'
Publication date: 31/01/2019
Field of study

The Zwicky Transient Facility is a large optical survey in multiple filters producing hundreds of thousands of transient alerts per night. We describe here various machine learning (ML) implementations and plans to make the maximal use of the large data set by taking advantage of the temporal nature of the data, and further combining it with other data sets. We start with the initial steps of separating bogus candidates from real ones, separating stars and galaxies, and go on to the classification of real objects into various classes. Besides the usual methods (e.g., based on features extracted from light curves) we also describe early plans for alternate methods including the use of domain adaptation, and deep learning. In a similar fashion we describe efforts to detect fast moving asteroids. We also describe the use of the Zooniverse platform for helping with classifications through the creation of training samples, and active learning. Finally we mention the synergistic aspects of ZTF and LSST from the ML perspective

arXiv.org e-Print Archive

eScholarship - University of California

Caltech Authors

Dokumenten-Publikationsserver der Humboldt-Universität zu Berlin

The United States COVID-19 Forecast Hub dataset

Author: Abbott Sam
Abu-Mostafa Yaser
Adee Madeline
Adhikari Bijaya
Adiga Aniruddha
Arik Sercan O.
Asplund John
Ayer Turgay
Baccam Prasith
Baek Jackie
Baer Thomas M.
Ban Xuegang
Bannur Nayana
Barber Ryan
Bathwal Rahil
Baxter Arden
Bejar Benjamín
Belov Artur A.
Ben-Nun Michal
Bennouna Amine
Berlin Abraham
Bertsimas Dimitris
Bhatia Sangeeta
Bian Jiang
Biegel Hannah
Bien Jacob
Biggerstaff Matthew
Bosch Jurgen
Bosse Nikos I.
Bouardi Hamza Tazi
Bracher Johannes
Brennen Andrea
Brenner Michael
Brooks Logan
Budzinski Jozef
Burant John C.
Cao Duy
Cao Wei
Castro Lauren
Cavany Sean
Cegan Jeffrey C.
Celi Leo A.
Chang Nicholas A.
Chattopadhyay Ishanu
Chen Jinghui
Chen Samuel
Chen YangQuan
Chen Ye
Chen Yixian
Chhatwal Jagpreet
Chiang Wen-Hao
Chinazzi Matteo
Chintanippu Krishna
Chitta Pavan
Cho Jae H.
Choirat Christine
Chow Carson C.
Coram Marc
Cornell Matthew
Corsetti Sabrina M.
Cramer Estee Y.
Cui Jiaming
Dahan Maytal
Dalgic Ozden O.
Davis Jessica T.
DesRoches David
Dettwiller Ian D.
Deva Ayush
Drake John M.
Dusenberry Mike
Edwards Jessie K.
Eisenberg Marisa C.
England William P.
Epshteyn Arkady
Erickson Anne
España Guido
Fairchild Geoffrey
Falb Karl
Faraone Stephen V.
Farias Vivek
Farthing Matthew W.
Ferres Juan Lavista
Flahault Antoine
Fong Chung-Yan
Forli Pedro
Fox Spencer
Funk Sebastian
Gaikedu Emmanuela
Gaither Kelly
Galasso Joseph
Gandhi Parth D.
Gao Junyi
Gao Lei
Gao Liyao
Gao Zhifeng
Gardner Lauren
George Glover E.
Georgescu Andreea
Gerding Aaron
Gerkin Richard C.
Gibson Graham Casey
Glass Lucas
Gneiting Tilmann
Goel Sumit
Gowda Jethin
Grantz Kyra H.
Green Alden
Gu Quanquan
Gu Youyang
Gu Zhiling
Guertin Stephanie L.
Guo Lihong
Gurung Heidi L.
Hamory Bruce
Hay Simon
Hellewell Joel
Hess Jonathan
Hill Alison L.
Hlavacek William
Ho Lam
Hong Qi-Jun
House Katie
Hu Addison J.
Huang Yi
Huang Yitao
Huang Yuxin
Hulme-Lowe Christopher
Hulse Juan Dent
Hunter Robert H.
Hurt Benjamin
Hussain Fazle
Huynh Huong
Ibrahim Mark
Ivy Julie S.
Jadbabaie Ali
Jahja Maria
Jain Chaman
Jain Chandini
Jain Sansiddh
Jayawardena Dasuni
Jin Qixuan
Jin Xiaoyong
Jivane Viresh
Jo Areum
Jo HyeongChan
Johansson Michael A.
Joshi Keya
Kalantari Rahi
Kaminsky Joshua
Kaminsky Kathryn
Kanal Elli
Kanji Abdul Hannan
Karimzadeh Morteza
Karlen Dean
Keegan Lindsay T.
Keskinocak Pinar
Khan Zeina
Khandelwal Ayush
Khurana Ankita
Kim Juhyun
Kim Myungjin
Kinsey Matt
Klein Ellen
Koyluoglu Ugur
Kraus Andrea
Kraus David
Krymova Ekaterina
Kulkarni Mihir
Kulkarni Pranav
Kumar Ajay
Kyriakides Christina
Lachmann Michael
Lacroix Timothee
Ladd Mary A.
Lafferty Brandon
Lakhani Anshul
Lami Omar Skali
Lauer Stephen A.
Le Khoa
Le Long T.
Le Matthew
Lee Elizabeth C.
Lee Gavin
Lega Joceline
Leis Helen
Lemaitre Joseph C.
Lessler Justin
Levi Retsef
Lewis Bryan
Li Chaozhuo
Li Chun-Liang
Li Michael L.
Li Xinyi
Liao Jason
Lim Steve
Lin Yen Ting
Linas Benjamin P.
Linkov Igor
Liu Tie-Yan
Lopez Velma K.
Lu Guoqing
Lucas Benjamin
Lushtak Samuel M.
Ma Yian
Mallela Abhishek
Manetti Elisa
Mann Ethan
Marathe Madhav
Marshall Maximilian
Martin Emily T.
Mayo Michael L.
Mayorga Maria E.
McAndrew Thomas
McCauley Ella
McConnell Steve
McDonald Daniel
Meakin Sophie R.
Mehrotra Prakhar
Mele Jessica
Meredith Hannah R.
Merugu Srujana
Meyers Lauren Ancel
Michaud Isaac
Miller Ely
Milliken John
Mody Vidhi
Mody Vrushti
Mohler George
Moloney Michael
Moore Sean
Morgan James
Morley Christopher P.
Mu Kunpeng
Mueller Peter
Mullany Luke C.
Murray Chris
Myers Robert L.
Mühlemann Anja
Nagraj V. P.
Namigai Kristen
Narasimhan Balasubramanian
Ndong David Nze
Neumann Jacob
Ngo Thoai
Nickel Maximilian
Niemi Jarad
Nirgudkar Ninad
Nixon Kristen
Nouvellet Pierre
Obozinski Guillaume
Oidtman Rachel
Oruc Buse Eylul
Osthus Dave
Ozcan Gokce
O’Dea Eamon B.
Pagano Robert
Panaggio Mark J.
Parno Matthew D.
Pasumarty Sujitha
Peddireddy Akhil Sai
Penna Nicolas D.
Perakis Georgia
Perez-Saez Javier
Perkins Alex
Pfeiffer Ruth
Pfister Tomas
Pigott David
Piontti Ana Pastore y
Piriya Matthew
Piwonka Noah
Politsch Collin
Popken Max
Porebski Przemyslaw
Posner Richard
Prakash B. Aditya
Qian Cheng
Rainwater-Lovett Kaitlin
Rajanala Samyak
Raval Alpan
Ravi Matt
Ray Evan L.
Reich Nicholas G.
Reich Nicholas G.
Reiner Robert C.
Riley Pete
Riley Steven
Rivadeneira Alvaro J. Castro
Rodríguez Alexander
Romberg Justin
Rosenstrom Erik T.
Rowland Michael A.
Rumack Aaron
Sagun Levent
Salekin Asif
Sarker Arnab
Schrader Chris
Schwarz Tom
Scott James G.
Sen Pei
Serban Nicoleta
Shah Apurv
Shah Devavrat
Shah Sam
Shakhnovich Elizabeth
Shaman Jeffrey
Sharma Rakshith
Sheldon Daniel
Sherratt Katharine
Shi Yunfeng
Shin Lauren
Shingi Siddhant
Shrivastav Monika
Siegel Daniel
Simon Noah
Singhvi Divya
Sinha Deeksha
Sinha Rajarishi
Slayton Rachel B.
Smith Claire P.
Soni Saksham
Soohoo Connor
Spaeder Jeffrey
Spantidakis Ioannis
Spatz Ryan
Srivastava Ajitesh
Stage Steven A.
Stark Ariane
Stiefeling Chris
Suchoski Bradley T.
Sumner Timothy
Sun Jimeng
Sun Tao
Sundar Saketh
Swann Julie L.
Tabassum Anika
Tallaksen Katharine
Tec Mauricio
Thanou Dorina
Thayaparan Leann
Tibshirani Rob
Tibshirani Ryan J.
Tirumala Kushal
Tiwari Avtansh
Tomar Vishal
Tran Quoc
Truelove Shaun A.
Trump Benjamin D.
Tsai Thomas
Tseng Albert
Tsiourvas Asterios
Turner Stephen D.
Turtle James
US COVID-19 Forecast Hub Consortium
Vahedi Behzad
Van Bussel Frank
van de Walle Axel
Varadarajan Vignesh
Venkatramanan Srinivasan
Ventura Valerie
Vespignani Alessandro
Vytheeswaran Jagath
Walker Jo W.
Walraven Robert
Wang Christopher
Wang Dongdong
Wang Dongliang
Wang Guannan
Wang Lijing
Wang Lily
Wang Lingxiao
Wang Liqiang
Wang Qinxia
Wang Yijin
Wang Yu-Xiang
Wang Yuanjia
Wang Yueying
Wang Zhongying
Wasserman Larry
Wattanchit Nutcha
Weisberg Shane
White Jerome
Wilde Joshua
Wilkinson Barrie
Wills Josh
Wilson Austin
Wilson Daniel
Wilson Shelby
Wolffram Daniel
Wolfinger Russ
Wong Alexander
Woody Spencer
Wu Dongxia
Xiao Cao
Xiao Jade
Xie Jiajia
Xie Shanghong
Xie Xing
Xiong Xinyue
Xu Pan
Xu Tianjian
Yamana Teresa K.
Yan Xifeng
Yeluri Akshay
Yeung Dit-Yan
Yoder Nate
Yogurtcu Osman N.
Yoon Jinsung
You Jialu
Yu Rose
Yu Shan
Yurk Dominic
Zeng Donglin
Zhang Leyou
Zhang Michael
Zhang Shun
Zhang Shunpu
Zhang Weitong
Zhang-James Yanli
Zhao Yanting
Zheng Andrew
Zheng Shun
Zhou Mingyuan
Zimmerman Peter
Zlokapa Alexander
Zoraghein Hamidreza
Zorn Martha W.
Zou Difan
Zou Zihang
Publication venue: Nature Research
Publication date: 17/08/2022
Field of study

Academic researchers, government agencies, industry groups, and individuals have produced forecasts at an unprecedented scale during the COVID-19 pandemic. To leverage these forecasts, the United States Centers for Disease Control and Prevention (CDC) partnered with an academic research lab at the University of Massachusetts Amherst to create the US COVID-19 Forecast Hub. Launched in April 2020, the Forecast Hub is a dataset with point and probabilistic forecasts of incident cases, incident hospitalizations, incident deaths, and cumulative deaths due to COVID-19 at county, state, and national, levels in the United States. Included forecasts represent a variety of modeling approaches, data sources, and assumptions regarding the spread of COVID-19. The goal of this dataset is to establish a standardized and comparable set of short-term forecasts from modeling teams. These data can be used to develop ensemble models, communicate forecasts to the public, create visualizations, compare models, and inform policies regarding COVID-19 mitigation. These open-source data are available via download from GitHub, through an online API, and through R packages

KITopen

Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models

Author: Aghajanyan Armen
Markosyan Aram H.
Tirumala Kushal
Zettlemoyer Luke
Publication venue
Publication date: 22/05/2022
Field of study

Despite their wide adoption, the underlying training and memorization dynamics of very large language models is not well understood. We empirically study exact memorization in causal and masked language modeling, across model sizes and throughout the training process. We measure the effects of dataset size, learning rate, and model size on memorization, finding that larger language models memorize training data faster across all settings. Surprisingly, we show that larger models can memorize a larger portion of the data before over-fitting and tend to forget less throughout the training process. We also analyze the memorization dynamics of different parts of speech and find that models memorize nouns and numbers first; we hypothesize and provide empirical evidence that nouns and numbers act as a unique identifier for memorizing individual training examples. Together, these findings present another piece of the broader puzzle of trying to understand what actually improves as models get bigger

arXiv.org e-Print Archive

Investigating Generalization by Controlling Normalized Margin

Author: Bernstein Jeremy
Farhang Alexander R.
Liu Yang
Tirumala Kushal
Yue Yisong
Publication venue
Publication date: 07/07/2022
Field of study

Weight norm

\|w\|

and margin

\gamma

participate in learning theory via the normalized margin

\gamma/\|w\|

. Since standard neural net optimizers do not control normalized margin, it is hard to test whether this quantity causally relates to generalization. This paper designs a series of experimental studies that explicitly control normalized margin and thereby tackle two central questions. First: does normalized margin always have a causal effect on generalization? The paper finds that no -- networks can be produced where normalized margin has seemingly no relationship with generalization, counter to the theory of Bartlett et al. (2017). Second: does normalized margin ever have a causal effect on generalization? The paper finds that yes -- in a standard training setup, test performance closely tracks normalized margin. The paper suggests a Gaussian process model as a promising explanation for this behavior

arXiv.org e-Print Archive

DeepStreaks: identifying fast-moving objects in the Zwicky Transient Facility data with deep learning

Author: Belicki Justin
Dekany Richard
Duev Dmitry A.
Frederick Sara
Graham Matthew J.
Helou George
Laher Russ R.
Mahabal Ashish
Masci Frank J.
Prince Thomas A.
Riddle Reed
Rosnet Philippe
Soumagnac Maayane T.
Tirumala Kushal
Ye Quanzhi
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2019
Field of study

International audienceWe present DeepStreaks, a convolutional-neural-network, deep-learning system designed to efficiently identify streaking fast-moving near-Earth objects that are detected in the data of the Zwicky Transient Facility (ZTF), a wide-field, time-domain survey using a dedicated 47 deg^2 camera attached to the Samuel Oschin 48-inch Telescope at the Palomar Observatory in California, United States. The system demonstrates a 96–98 per cent true positive rate, depending on the night, while keeping the false positive rate below 1 per cent. The sensitivity of DeepStreaks is quantified by the performance on the test data sets as well as using known near-Earth objects observed by ZTF. The system is deployed and adapted for usage within the ZTF Solar system framework and has significantly reduced human involvement in the streak identification process, from several hours to typically under 10 min per day

arXiv.org e-Print Archive

HAL-IN2P3

HAL Clermont Université